Semi-Supervised Noisy Label Learning for Chinese Medical Named Entity Recognition

نویسندگان

چکیده

This paper describes our approach for the Chinese clinical named entity recognition (CNER) task organized by 2020 China Conference on Knowledge Graph and Semantic Computing (CCKS) competition. In this task, we need to identify boundary category labels of six entities from electronic medical record (EMR). We constructed a hybrid system composed semi-supervised noisy label learning model based adversarial training rule post-processing module. The core idea is reduce impact data noise optimizing results. Besides, used rules correct three cases redundant labeling, missing wrong labeling in prediction Our method proposed achieved strict criteria 0.9156 relax 0.9660 final test set, ranking first.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-supervised Named Entity Recognition in noisy-text

Many of the existing Named Entity Recognition (NER) solutions are built based on news corpus data with proper syntax. These solutions might not lead to highly accurate results when being applied to noisy, user generated data, e.g., tweets, which can feature sloppy spelling, concept drift, and limited contextualization of terms and concepts due to length constraints. The models described in this...

متن کامل

A Semi-supervised Learning Approach to Arabic Named Entity Recognition

We present ASemiNER, a semisupervised algorithm for identifying Named Entities (NEs) in Arabic text. ASemiNER does not require annotated training data, or gazetteers. It also can be easily adapted to handle more than the three standard NE types (Person, Location, and Organisation). To our knowledge, our algorithm is the first study that intensively investigates the semi-supervised pattern-based...

متن کامل

Semi-supervised Bio-named Entity Recognition with Word-Codebook Learning

We describe a novel semi-supervised method called WordCodebook Learning (WCL), and apply it to the task of bionamed entity recognition (bioNER). Typical bioNER systems can be seen as tasks of assigning labels to words in bioliterature text. To improve supervised tagging, WCL learns a class of word-level feature embeddings to capture word semantic meanings or word label patterns from a large unl...

متن کامل

Semi-Supervised Learning of Named Entity Substructure

The goal of this project was two-fold: (1) to provide an algorithm to correctly find and label named entities in text, and (2) to uncover substructure in the named entities (such as a first name, last name distinction among person entities). The underlying algorithm used is a Class Hidden Markov Model (CHMM), a Hidden Markov Model with hidden states that emit observed words as well as observed ...

متن کامل

Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model

Named entity recognition (NER) plays an important role in the NLP literature. The traditional methods tend to employ large annotated corpus to achieve a high performance. Different with many semi-supervised learning models for NER task, in this paper, we employ the graph-based semi-supervised learning (GBSSL) method to utilize the freely available unlabeled data. The experiment shows that the u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data intelligence

سال: 2021

ISSN: ['2096-7004', '2641-435X']

DOI: https://doi.org/10.1162/dint_a_00099